Automatic formatted transcripts for videos

نویسندگان

  • Aasish Pappu
  • Amanda Stent
چکیده

Multimedia content may be supplemented with time-aligned closed captions for accessibility. Often these captions are created manually by professional editors — an expensive and timeconsuming process. In this paper, we present a novel approach to automatic creation of a well-formatted, readable transcript for a video from closed captions or ASR output. Our approach uses acoustic and lexical features extracted from the video and the raw transcription/caption files. We compare our approach with two standard baselines: a) silence segmented transcripts and b) text-only segmented transcripts. We show that our approach outperforms both these baselines based on subjective and objective metrics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

International Journal of advanced studies in Computer Science and Engineering

Many organizations and universities provide distance learning by recording classroom lectures and making them available to students over the Internet. A repository generally contains hundreds of such lecture videos. Each lecture video is typically a more than hour’s duration and is often huge. It is sometimes clumsy for students to search through an entire video, or across many videos, in order...

متن کامل

Mismatch interpretation by semantics-driven alignment∗

This paper describes a method for the alignment of automatically recognized speech transcripts with formatted documents manually derived from the speech recognition results. Novel features of our alignment method are a parametrizable scoring function, an intelligent tokenization system drawing on domain knowledge, and semantic comparisons. The field of application are dictated medical reports p...

متن کامل

Digital Watermarking Technology in Different Domains

Due to high speed computer networks, the use of digitally formatted data has increased many folds.The digital data can be duplicated and edited with great ease which has led to a need for effectivecopyright protection tools. Digital Watermarking is a technology of embedding watermark withintellectual property rights into images, videos, audios and other multimedia data by a certainalgorithm .Di...

متن کامل

TUD-MIR at MediaEval 2011 Genre Tagging Task: Query expansion from a limited number of labeled videos

In this paper we present results of our initial research on genre tagging. We approach the task from information retrieval perspective using a relatively small number of labeled videos in the development set to mine query expansion terms characteristic of each genre. We also investigate which sources of information associated with the videos or extracted from their audio channel, e.g. title, de...

متن کامل

Video Indexing and Automatic Caption Creation

This paper presents the design and implementation of a video indexing and automatic caption creation system. The system is able to extract audio from videos and to get the transcript directly from the audio file using the newly designed audio-to-text engine based on Hidden Markov Model (HMM). Transcripts can be edited and the corresponding time stamps are updated automatically. The video indexi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015